Filter Tokens (by Region) (Text Processing)
Synopsis
Filters tokens based on the region around another token.Description
This operator keeps only tokens in a region of a specified token. Overlapping regions are kept as a whole, i.e. if the token occurs several times, the maximum regions around all tokens are built and the union of those regions is delivered.
Input
- document
The document port.
Output
- document
The document port.
Parameters
- conditionThe condition a document must fulfill to be kept. Range:
- stringThe string that should be compared to. Range:
- regular_expressionThe regular expression for that should match. Range:
- case_sensitiveSpecifies whether the comparison should be case-sensitive. Range:
- invert conditionSpecifies whether comparison outcome should be inverted. Range:
- tokens_beforeThe maximum number of tokens kept before the specified token. Range:
- tokens_afterThe maximum number of tokens kept after the specified token. Range: